Video file processing method, video file processing device and monitoring system

ABSTRACT

A video file processing method for making and outputting an event film according to video files is provided. The method includes: acquiring at least one video file and detecting a plurality of events and an occurring time of each event in the video file; extracting a plurality of event clips including the events from the video file according to the occurring time; obtaining an event film by assembling a plurality of concerned clips in the event clips according to a merging rule; and outputting the event film. In addition, a video file processing device and a monitoring system using the method are also provided.

CROSS-REFERENCE TO RELATED APPLICATION

This application claims the priority benefit of Taiwan applicationserial no. 107144889, filed on Dec. 12, 2018. The entirety of theabove-mentioned patent application is hereby incorporated by referenceherein and made a part of this specification.

BACKGROUND OF THE INVENTION 1. Field of the Invention

The present invention relates to a film editing technology, inparticular, to a video file processing method, a video file processingdevice, and a monitoring system.

2. Description of Related Art

In recent years, the rate of population aging is growing rapidly. In abusy city, children are not able to take care of their elders at closerange at any time. Therefore, home care is becoming more and moreimportant, and home care institutions or care dispatch centers are alsoborn. Even in institutions for aged care or patient care, caregiverscannot take care of all elders or patients one to one. Therefore, theseinstitutions often set up monitoring systems such as a plurality ofcameras in an indoor environment to monitor elders or patients.

However, although the monitoring system records the video of eachcamera, the video content cannot be well analyzed or organized.Therefore, when there are accidents or abnormal events such as falls, acaregiver or manager needs to take a lot of time and energy to accessvideo files concerned with the search. If there is a plurality ofcameras, it is more difficult for a plurality of video files to rapidlydistinguish the image correlation to find out the occurrence and causeof an event. In addition, the synchronization and overlapping storage ofa plurality of video files of a plurality of cameras may also consume orwaste a considerable storage space.

SUMMARY OF THE INVENTION

In view of this, the present invention provides a video file processingmethod, a video file processing device, and a monitoring system, whichcan assemble a plurality of video files more efficiently.

The video file processing method according to an embodiment of thepresent invention includes: acquiring at least one video file anddetecting a plurality of events and an occurring time of each event inthe at least one video file; extracting a plurality of event clipsincluding the plurality of events from the at least one video fileaccording to the occurring time; obtaining an event film by assembling aplurality of concerned clips in the plurality of event clips accordingto a merging rule; and outputting the event film.

The video file processing device according to an embodiment of thepresent invention includes a storage element and a processor. Thestorage element stores a plurality of modules. The processor is coupledto the storage element, and configured to access and perform the modulesstored in the storage element. The modules include a video fileacquiring module, an event detecting module, a clip extracting module, aclip assembling module, and an event outputting module. The video fileacquiring module is configured to acquire at least one video file. Theevent detecting module is configured to detect a plurality of events andan occurring time of each event in the at least one video file. The clipextracting module is configured to extract a plurality of event clipsincluding the plurality of events from the at least one video fileaccording to the occurring time. The clip assembling module isconfigured to obtain an event film by assembling a plurality ofconcerned clips in the plurality of event clips according to a mergingrule, where the merging rule includes a plurality of pre-definedreasonable event sequences, and the events in the event film conform tothe plurality of reasonable event sequences. The film outputting moduleis configured to output the event film.

The monitoring system according to an embodiment of the presentinvention includes at least one camera and a video recording device. Theat least one camera is configured to capture at least one video file.The video recording device is coupled to the at least one camera, andconfigured to: receive and record the at least one video file; acquirethe at least one video file and detect a plurality of events and anoccurring time of each event in the at least one video file; extract aplurality of event clips including the plurality of events from the atleast one video file according to the occurring time; obtain an eventfilm by assembling a plurality of concerned clips in the plurality ofevent clips according to a merging rule, where the merging rule includesa plurality of pre-defined reasonable event sequences, and the events inthe event film conform to the plurality of reasonable event sequences;and output the event film.

Based on the foregoing, the video file processing method, the video fileprocessing device, and the monitoring system provided by the embodimentsof the present invention can analyze a plurality of event clips fromvideo files captured by one or more cameras, and organize a portion ofevent clips into an event film through a well-defined merging rule.Accordingly, a user can find required segments more rapidly, therebyimproving convenience and improving user experience.

In order to make the aforementioned and other features and advantages ofthe present invention comprehensible, embodiments accompanied withfigures are described in detail below.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of a monitoring system according to anembodiment of the present invention.

FIG. 2 is a schematic diagram of a video file processing deviceaccording to an embodiment of the present invention.

FIG. 3 is a flowchart of a video file processing method according to anembodiment of the present invention.

FIG. 4 is a schematic diagram of assembling event clips of video filesof a same camera according to an embodiment of the present invention.

FIG. 5 is a schematic diagram of obtaining an event film by assembling aplurality of consecutive clips according to an embodiment of the presentinvention.

FIG. 6 is a schematic diagram of obtaining an event film by assembling aplurality of concerned clips including different people according to anembodiment of the present invention.

DESCRIPTION OF THE EMBODIMENTS

Some of the embodiments of the present invention will be accompaniedwith the drawings in the following for detailed description. When thesame element symbols referenced in the following description appear indifferent drawings, they will be regarded as the same or similarelements. These examples are merely a part of the present invention anddo not disclose all of the implementable modes of the present invention.Rather, these embodiments are merely examples of a method, device andsystem in the scope of the patent application of the present invention.

FIG. 1 is a schematic diagram of a monitoring system according to anembodiment of the present invention.

Referring to FIG. 1, the monitoring system of the present embodimentincludes a plurality of cameras 110A, 110B, and 110C, a network videorecorder (NVR) 120, a video file processing device 130, and a clouddatabase 140. The plurality of cameras 110A, 110B, and 110C may becoupled to the network video recorder 120 in a wired or wireless manner.In addition, the video file processing device 130 may be coupled to thenetwork video recorder 120 and the cloud database 140 in a wired orwireless manner. In the present embodiment, video files captured by theplurality of cameras 110A, 110B, and 110C may be recorded in the networkvideo recorder 120, and the video file processing device 130 acquires avideo file from the network video recorder 120, processes the video fileto generate an event film, and then uploads and stores the event filminto the cloud database 140. However, in other embodiments of thepresent invention, the network video recorder 120 and the video fileprocessing device 130 may also be integrated into one video recordingdevice. In other embodiments of the present invention, the video filescaptured by the plurality of cameras 110A, 110B, and 110C may also berecorded in the video file processing device 130.

In the present embodiment, the plurality of cameras 110A, 110B, and 110Care disposed at different positions respectively and configured tocapture environments of different positions and/or angles to generatevideo files respectively. For example, the camera 110A is disposedadjacent to a doorway of a room to capture a person entering and leavingthe room; the camera 110B is disposed on a wall or ceiling of the roomto capture a person walking in the room; and the camera 110C is disposedon a bedside in the room to capture a person who goes to bed, gets outof bed, or lies in bed.

Although the monitoring system of the present embodiment has beendescribed with three cameras as an example, the present invention is notlimited thereto. In other embodiments, the monitoring system may alsoinclude one, two, four, five or more cameras.

In the present embodiment, the network video recorder 120 is configuredto collect, integrate and manage the video files captured by the cameras110A, 110B, and 110C. For example, the network video recorder 120 mayinclude a storage device, and the network video recorder 120 mayorganize the video files captured by each camera by using a folder or afile name, and the like.

In the present embodiment, the video file processing device 130 is, forexample, an electronic device having a video file processing function,such as a personal computer, a notebook computer, a smart phone, atablet computer, or a personal digital assistant. However, the presentinvention is not limited thereto. FIG. 2 is a schematic diagram of avideo file processing device according to an embodiment of the presentinvention. Referring to FIG. 2, the video file processing device 130includes a processor 131 and a storage element 132, where the processor131 is coupled to the storage element 132. The storage element 132stores a video file acquiring module 132 a, an event detecting module132 b, a clip extracting module 132 c, a clip assembling module 132 d,and a film outputting module 132 e. Each of the foregoing modules is,for example, a computer program, which can load the processor 131 toperform the video file processing function in the embodiment of thepresent invention by the processor 131. It should be noted that thestorage element 132 described in the foregoing embodiments is notlimited to a single storage, and the foregoing modules may also beseparately stored in two or more storages of the same or differenttypes.

In the present embodiment, the video file processing device 130generates event clips, event films, and the like by processing the videofiles, and the video file processing device 130 may upload the eventclips and the event films to the cloud database 140. In someembodiments, the cloud database 140 may be connected to a server. Theforegoing server lists the event clips and the event films recorded inthe cloud database 140, for example, in the form of web pages, and auser can click on the web page to download or view a selected event clipor event film online. However, the present invention does not limit aspecific method for using the event clips and the event films recordedin the cloud database 140 herein.

FIG. 3 is a flowchart of a video file processing method according to anembodiment of the present invention. The method of the presentembodiment is applied to the video file processing device 130 in FIG. 2,and the detailed steps of the video file processing method will bedescribed below in conjunction with various elements in the video fileprocessing device 130.

First, the processor 131 acquires a video file and detects a pluralityof events and an occurring time of each event in the video file (stepS302). In detail, the video file acquiring module 132 a acquires thevideo files captured by the cameras 110A, 110B, and 110C from thenetwork video recorder 120, and the event detecting module 132 b usesimage analysis to detect whether a default event occurs in the videofile or not and to record an occurring time of the event. The foregoingevents include, for example, entering and leaving a room, falling, goingto bed, getting out of bed, getting up, lying down, and bed activities,and the like, and the present invention is not limited thereto.

In the present embodiment, the event detecting module 132 b detectsdifferent events for video files from different cameras. For example,for the video file from the camera 110A at the doorway, the eventdetecting module 132 b detects an event such as entering or leaving theroom; for the video file from the camera 110B on the wall or ceiling ofthe room, the event detecting module 132 b detects a falling event; andfor the camera 110C from the bedside, the event detecting module 132 bdetects events such as going to bed, getting out of bed, getting up,lying down, and bed activities. However, the present invention is notlimited thereto. In other embodiments, the event detecting module 132 bmay also not distinguish the source of the video file when detecting anevent.

Then, the processor 131 extracts a plurality of event clips includingthe plurality of events from the video file according to the occurringtime (step S304). In detail, the clip extracting module 132 c extractsan event clip including an event from the video file according to theoccurring time of the event. For example, if the event detecting module132 b detects that a first event occurs at a first occurring time in afirst video file, the clip extracting module 132 c extracts, accordingto the first occurring time, a first event clip including the firstevent from the first video file, where a time segment of the first eventclip may include the first occurring time; and if the event detectingmodule 132 b detects that a second event occurs at a second occurringtime in a second video file different from the first video file, theclip extracting module 132 c extracts, according to the second occurringtime, a second event clip including the second event from the secondvideo file, where a time segment of the second event clip may includethe second occurring time.

In some embodiments, the clip extracting module 132 c makes an eventclip by taking 15 seconds before and after the occurring time of anevent. For example, if the event detecting module 132 b detects afalling event from a video file at 12:00:00, the clip extracting module132 c extracts a fragment of 11:59:45 to 12:00:15 from the video file toserve as an event clip. However, the present invention does not limit aspecific manner of an event clip, as long as the extracted event clipincludes an occurred event.

In the present embodiment, the clip assembling module 132 d may alsoassemble event clips of video files from the same camera if two eventclips are temporally close and there is any overlap of images. Indetail, if the same camera captures two events within a short time, theclip extracting module 132 c will generate two event clips of the twoevents. The clip assembling module 132 d determines that if occurringtimes of the two events are quite close and images of the two eventclips partially overlap, the two event clips are assembled into oneevent clip, that is, overlapping image portions are clipped from one ofthe two event clips, and then the two processed event clips are spliced.For example, the clip extracting module 132 c extracts a first eventclip including a first event from a first video file, and extracts asecond event clip including a second event from a second video file. Ifthe first video file and the second video file are one or two differentvideo files from the same camera, when a time difference between a firstoccurring time of the first event and a second occurring time of thesecond event is less than a preset threshold value, the clip assemblingmodule 132 d assembles the first event clip and the second event clipinto an event clip simultaneously including the first event and thesecond event.

FIG. 4 is a schematic diagram of assembling event clips of video filesof a same camera according to an embodiment of the present invention.

Referring to FIG. 4, the clip extracting module 132 c makes, forexample, an event clip by taking 15 seconds before and after theoccurring time of an event. In the present embodiment, event clips V1 toV4 are all extracted by the clip extracting module 132 c from the videofiles captured by the camera 110C, where the event clip V1 includes agoing-to-bed event, the event clip V2 includes a lying-down event, theevent clip V3 includes a get-up event, and the event clip V4 includes agetting-out-of-bed event.

Since a time difference between two occurring times of the event clip V1and the event clip V2 is less than a preset threshold value (such as 15seconds), the time segments of the event clip V1 and the event clip V2may overlap. Based on this, the clip assembling module 132 d assemblesthe event clip V1 and the event clip V2 into an event clip V1+V2, whichsequentially includes a going-to-bed event and a lying-down event. Inthe present embodiment, the starting time of the assembled event clipV1+V2 is the same as the starting time of the event clip V1, and the endtime of the event clip V1+V2 is the same as the end time of the eventclip V2. However, the present invention does not limit a specific modeof assembling herein.

Similarly, since a time difference between two occurring times of theevent clip V3 and the event clip V4 is less than the preset thresholdvalue (such as 15 seconds), the time segments of the event clip V3 andthe event clip V4 may overlap. Based on this, the clip assembling module132 d assembles the event clip V3 and the event clip V4 into an eventclip V3+V4, which sequentially includes a get-up event and agetting-out-of-bed event.

In particular, the time length of the assembled event clip V1+V2 may beshorter than the total time length of the event clip V1 and the eventclip V2, and the time length of the assembled event clip V3+V4 may beshorter than the total time length of the event clip V3 and the eventclip V4. Therefore, the time can be saved during playing. In the presentembodiment, a time R1 saved by playing the assembled event clip V1+V2compared to playing the event clip V1 and the event clip V2 is, forexample, equal to the overlap time of the event clip V1 and the eventclip V2; and a time R2 saved by playing the assembled event clip V3+V4compared to playing the event clip V3 and the event clip V4 is, forexample, equal to the overlap time of the event clip V3 and the eventclip V4. In some embodiments, all event clips are temporarily stored ina temporary storage area of the video file processing device 130, so thestorage space can also be saved by assembling some event clips inadvance.

In addition, in some embodiments, the processor 131 first determineswhether two events correspond to a same person or not, and only when thetwo events correspond to the same person, event clips of the two eventsare assembled. For example, the processor 131 first uses image analysisto determine whether the going-to-bed event in the event clip V1 and thelying-down event in the event clip V2 correspond to the same person ornot, and if so, the event clip V1 and the event clip V2 are assembledinto the same event clip V1+V2. Otherwise, if the going-to-bed event inthe event clip V1 and the lying-down event in the event clip V2correspond to different people, even if there is an overlap of a timeinterval between the event clip V1 and the event clip V2, they will notbe assembled.

According to the foregoing mode, the processor 131 can acquire aplurality of event clips, and each event clip includes one or moreevents. After acquiring a plurality of event clips, the processor 131obtains an event film by assembling a plurality of concerned clips inthe plurality of event clips according to a merging rule (step S306).Specifically, the plurality of concerned clips refers to event clipsthat are assembled into the same event film, and these concerned clipsthat are assembled into the same event film must conform to the mergingrule. From another perspective, among the plurality of event clips, aplurality of consecutive event clips that conform to the merging rulemay be referred to as a plurality of concerned clips. Details regardingthe merging rule will be described in the following embodiments.

First, the clip assembling module 132 d sorts the plurality of eventclips according to a starting time of the plurality of event clips (stepS3062), and then determines whether the plurality of events in aplurality of consecutive event clips conform to the merging rule or not(step S3064). If so, the clip assembling module 132 d obtains the eventfilm by merging the plurality of consecutive event clips in which theplurality of events conform to the merging rule (step S3066). Theforegoing steps will be described below by way of an embodiment withreference to FIG. 5.

FIG. 5 is a schematic diagram of obtaining an event film by assembling aplurality of concerned clips according to an embodiment of the presentinvention.

Referring to FIG. 5, event clips V0 and V6 are, for example, extractedby the clip extracting module 132 c from the video files captured by thecamera 110A, where the event clip V0 includes an entry (door) event, andthe event clip V6 includes a leaving (door) event. The event clips V1+V2and V3+V4 are, for example, an assembly result of the clip assemblingmodule 132 d in the embodiment of FIG. 4. The event clip V5 is, forexample, extracted by the clip extracting module 132 c from the videofiles captured by the camera 110B, where the event clip V5 includes afalling event. If sorting is performed according to the starting time,five consecutive event clips are sequentially the event clip V0, theevent clip V1+V2, the event clip V3+V4, the event clip V5, and the eventclip V6.

If a plurality of events in a plurality of consecutive event clipsconforms to the merging rule, the clip assembling module 132 d mergesthe consecutive event clips together. In the present embodiment, themerging rule includes, for example, a plurality of pre-definedreasonable event sequences. If a plurality of events in a plurality ofconsecutive event clips does not conform to the foregoing reasonableevent sequences, these events do not conform to the merging rule. Forexample, a plurality of reasonable event sequences defined in themerging rule is displayed in the following table 1.

TABLE 1 Number Camera Event Reasonable event sequence 1 110A Enter First2 110A Leave Last 3 110B Fall Between 1 and 4 or between 5 and 2 4 110CGo to bed Between 1 and 5 or between 1 and 6 orbetween 1 and 7 orbetween 1 and 8 5 110C Get out of bed Between 4 and 2 6 110C Get upBetween 7 and 5 7 110C Lie down Between 4 and 5 8 110C Bed activityBetween 7 and 6

Referring to Table 1, if a plurality of events include an entry event,the entry event must be a first event among the plurality of events toconform to the reasonable event sequences; if a plurality of eventsinclude a leaving event, the leaving event must be the last event amongthe plurality of events to conform to the reasonable event sequences; ifa plurality of events include a falling event, the falling event must besorted between a No. 1 entry event and a No. 4 going-to-bed event, orsorted between a No. 5 getting-out-of-bed event and a No. 2 leavingevent to conform to the reasonable event sequences, and so on.

Returning to FIG. 5, a plurality of events in a plurality of consecutiveevent clips V0, V1+V2, V3+V4, V5, and V6 are sequentially an entryevent, a going-to-bed event, a lying-down event, a get-up event, agetting-out-of-bed event, a falling event, and a leaving event. Afterdetermining that the plurality of events conform to the reasonable eventsequences in Table 1, the clip assembling module 132 d merges theconsecutive event clips V0, V1+V2, V3+V4, V5, and V6, to obtain an eventfilm V0+V1+V2+V3+V4+V5+V6. Since the plurality of consecutive eventclips V0, V1+V2, V3+V4, V5, V6 are merged into an event film, theseconsecutive event clips V0, V1+V2, V3+V4, V5, and V6 may be referred toas a plurality of concerned clips.

In some embodiments, when two event clips overlapping in time intervalsare merged, the clip assembling module 132 d clips, for example, theevent clip sorted ahead, and then splices the event clip sorted behind.When two event clips not overlapping in time intervals are merged, theclip assembling module 132 d may, for example, directly allow the eventclip sorted behind to be sorted behind the event clip sorted ahead.

For example, a time interval of the event clip V0 is in a range of12:00:00 to 12:00:15; a time interval of the event clip V1+V2 is in arange of 12:00:10 to 12:00:30; a time interval of the event clip V3+V4is in a range of 14:00:00 to 14:00:30; a time interval of the event clipV5 is in a range of 14:00:25 to 14:00:55; and a time interval of theevent clip V6 is in a range of 14:00:50 to 14:01:20. The content of theevent film V0+V1+V2+V3+V4+V5+V6 from the 0^(th) second to the 10^(th)second is the content of the event clip V0; the content from the 10^(th)second to the 30^(th) second is the content of the event clip V1+V2; thecontent from the 30^(th) second to the 55^(th) second is the content ofthe event clip V3+V4; the content from the 55^(th) second to the 80^(th)second is the content of the event clip V5; and the content from the80^(th) second to the 110^(th) second is the content of the event clipV6. That is to say, a video file from 12:00:00 to 14:01:20 in the roomwill be condensed into a 110-second event film including the fallingevent. However, the present invention does not limit a specific mannerof merging herein, and those of ordinary skill in the art can implementit according to different needs.

It is worth mentioning that, in order to facilitate the description ofthe details of assembling event clips, five consecutive event clipsconforming to the merging rule are exemplarily selected from the eventclips V0, V1+V2, V3+V4, V5, and V6 in the embodiment of FIG. 5 as anexample. It must be understood that the plurality of event clipsobtained by the processor 131 in step S304 may also include a pluralityof other consecutive event clips, which may also be assembled into anevent film in the manner described in the previous paragraph. In otherwords, after sorting is performed according to the starting time, aslong as a plurality of events in a plurality of consecutive event clipsconforms to the merging rule, the clip assembling module 132 d mergesthe consecutive event clips together.

In some embodiments, the merging rule also stipulates that all events ina plurality of concerned clips must correspond to the same person, thatis, each event in an event film occurs on the same person.

FIG. 6 is a schematic diagram of obtaining an event film by assembling aplurality of concerned clips including different people according to anembodiment of the present invention.

Referring to FIG. 6, event clips A0, B0, and B1 are, for example,extracted by the clip extracting module 132 c from the video filescaptured by the camera 110A, where the event clip A0 includes an entryevent of a person A, the event clip B0 includes an entry event of aperson B, and the event clip B1 includes a leaving event of the personB. The event clip A1+A2 is, for example, an assembling result of theclip assembling module 132 d, including a going-to-bed event and alying-down event of the person A.

In some embodiments, the processor 131 uses, for example, a method ofimage analysis to detect a person corresponding to an event. Forexample, the processor 131 may first use face detection to identify aperson corresponding to an event. For another example, the processor 131may also estimate only the height of a person in an image, or use animage histogram to obtain the color of clothes, and the like todetermine whether the detected event corresponds to the same person ornot. In other words, the present invention does not limit a specificjudgment manner of a person corresponding to an event.

In the present embodiment, the clip assembling module 132 d assemblesevent clips according to a person corresponding to an event. Since theevents in the event clip A0 and the event clip B0 correspond todifferent people, although the time intervals of the event clips A0 andB0 overlap, the clip assembling module 132 d does not assemble the eventclip A0 and the event clip B0. Since the events in the event clip A0 andthe event clip A1+A2 both correspond to a figure A and the entry event,the going-to-bed event and the lying-down event conform to a reasonableevent sequence, the clip assembling module 132 d merges the event clipA0 and the event clip A1+A2 into an event film A0+A1+A2. On the otherhand, since the events in the event clip B0 and the event clip B1 bothcorrespond to a person B and the entry event and the leaving eventconform to a reasonable event sequence, the clip assembling module 132 dmerges the event clip B0 and the event clip B1 into another event filmB0+B1.

In this way, the processor 131 can merge a plurality of consecutiveevent clips conforming to the merging rule among the plurality of eventclips into an event film. Finally, the processor 131 outputs the eventfilm and the event clips that are not merged (step S308). In detail, thefilm outputting module 132 e outputs other event clips that are notmerged into the event film, in addition to the event film generated bythe clip assembling module 132 d. For example, the film outputtingmodule 132 e uploads and stores, for example, the event film and theevent clips not merged into the event film to the cloud database 140,the event film and the event clips uploaded to the cloud database 140may be, for example, presented in the form of a list in a web page, anda user is allowed to download the event film or event clips throughclicking or view the clicked event film or event clips online.

In some embodiments, before outputting the event films and the eventclips, the film outputting module 132 e may first name each event filmand event clip according to the events therein. For example, theprocessor 131 presets an importance ranking of various events, andnames, according to the event with the highest importance ranking in anevent film or event clip, the event film or event clip. For example, inFIG. 5, the importance ranking of the falling event is the highest, sothe event film V0+V1+V2+V3+V4+V5+V6 will be named according to thefalling event, and is named “falling at 14:00:40”, for example,according to the falling event and its occurring time. However, thepresent invention does not limit the naming manner herein. In otherembodiments, the output event film and event clip may also be named inother manners.

Based on the foregoing, the video file processing method, the video fileprocessing device, and the monitoring system provided by the embodimentsof the present invention can analyze a plurality of event clips fromvideo files captured by one or more cameras, and organize a portion ofevent clips into an event film through a well-defined merging rule.Accordingly, a user can find required segments more rapidly, therebyimproving convenience and improving user experience.

Although the present invention has been disclosed with the foregoingembodiments, it is not intended to limit the present invention, and itis possible for any person of ordinary skill in the art to make somemodifications and refinements without departing from the spirit andscope of the present invention. Therefore, the scope of protection ofthe present invention should be defined by the scope of the appendedclaims.

What is claimed is:
 1. A video file processing method, comprising:acquiring a first video file including at least one first event from afirst camera and acquiring a second video file including at least onesecond event from a second camera, wherein the first camera and thesecond camera are configured to capture environments of different eventpositions to produce the first video file and the second video filerespectively; detecting the at least one first event from the firstvideo file and detecting the at least one second event from the secondvideo file; detecting a first occurring time of the at least one firstevent in the first video file and detecting a second occurring time ofthe at least one second event in the second video file; extracting atleast one first event clip corresponding to the at least one first eventfrom the first video file by acquiring a first time period before andafter the first occurring time of the at least one first event;extracting at least one second event clip corresponding to the at leastone second event from the second video file by acquiring a second timeperiod before and after the second occurring time of the at least onesecond event; determining whether a time difference between occurringstarting times of any two of extracted first event clips or betweenstarting times of any two of extracted second event clips is less than apreset threshold value; finding an overlapped time segment between theany two of extracted first event clips or between the any two ofextracted second event clips; generating an assembled event clip by bothsumming the any two of extracted first event clips or the any two ofextracted second event clips and removing the overlapped time segmentany two of extracted first event clips or between the any two ofextracted second clips in response to the overlapped time segmentbetween the any two of extracted first event clips or between the anytwo of extracted second event clips having been found; obtaining amerged event film by assembling a plurality of concerned clips accordingto a merging rule, wherein the plurality of concerned clips comprise oneor more of the at least one first event clip, the at least one secondevent clip, and the assembled event clip, and each of the plurality ofconcerned clips included in the merged event film corresponds todifferent predetermined events; and outputting the merged event film. 2.The video file processing method according to claim 1, wherein themerging rule comprises a plurality of pre-defined reasonable eventsequences, and the different predetermined events in the merged eventfilm conform to the pre-defined reasonable event sequences.
 3. The videofile processing method according to claim 1, wherein the differentpredetermined events occur and are detected in the first video file andthe second video file.
 4. The video file processing method according toclaim 1, wherein the step of obtaining the merged event film byassembling the plurality of concerned clips according to the mergingrule comprises: sorting the at least one first event clip and the atleast one second event clip according to a starting time thereof;determining whether the different predetermined events in a plurality ofconsecutive event clips in the at least one first event clip and the atleast one second event clip conform to the merging rule; and obtainingthe merged event film by merging, in response to a determining resultindicative of that the different predetermined events in the consecutiveevent clips conform to the merging rule, the consecutive event clips. 5.The video file processing method according to claim 1, wherein thedifferent predetermined events comprised by the concerned clipscorrespond to a same person.
 6. The video file processing methodaccording to claim 1, wherein the step of outputting the merged eventfilm comprises: uploading and storing the merged event film into a clouddatabase.
 7. A video file processing device, comprising: a storageelement, storing a plurality of instructions; and a processor, coupledto the storage element, and configured to load the instructions, andwhen the instructions are executed, the processor functions as: a videofile acquiring module, configured to acquire a first video fileincluding at least one first event from a first camera and acquiring asecond video file including at least one second event from a secondcamera, wherein the first camera and the second camera are configured tocapture environments of different event positions to produce the firstvideo file and the second video file respectively; an event detectingmodule, configured to detect the at least one first event from the firstvideo file and detect the at least one second event from the secondvideo file, and to detect a first occurring time of the at least onefirst event in the first video file and detect a second occurring timeof the at least one second event in the second video file; a clipextracting module, configured to extract at least one first event clipcorresponding to the at least one first event from the first video fileby acquiring a first time period before and after the first occurringtime of the at least one first event and extract at least one secondevent clip corresponding to the at least one second event from thesecond video file by acquiring a second time period before and after thesecond occurring time of the at least one second event; a clipassembling module, configured to: determine whether a time differencebetween starting times of any two of the extracted first event clips orbetween starting times of any two of the extracted second clips is lessthan a preset threshold value; finding an overlapped time segmentbetween the any two of extracted first event clips or between the anytwo of extracted second event clips; generating an assembled event clipby both summing the any two of extracted first event clips or the anytwo of extracted second event clips and removing the overlapped timesegment between the any two of extracted first event clips or betweenthe any two of extracted second clips in response to the overlapped timesegment between the any two of extracted first event clips or betweenthe any two of extracted second event clips having been found; obtain amerged event film by assembling a plurality of concerned clips accordingto a merging rule, wherein the plurality of concerned clips comprise oneor more of the at least one first event clip, the at least one secondevent clip, and the assembled event clip, and each of the plurality ofconcerned clips included in the merged event film corresponds todifferent predetermined events, wherein the merging rule comprises aplurality of pre-defined reasonable event sequences, and the differentpredetermined events in the merged event film conform to the pre-definedreasonable event sequences; and a film outputting module, configured tooutput the merged event film.
 8. The video file processing deviceaccording to claim 7, wherein the event detecting module detects thepredetermined different events which occur in the first video file andthe second video file.
 9. The video file processing device according toclaim 7, wherein the clip assembling module is configured to: sort theat least one first event clip and the at least one second event clipaccording to a starting time thereof; determine whether the differentpredetermined events in a plurality of consecutive event clips in the atleast one first event clip and the at least one second event clipconform to the merging rule; and obtain the merged event film bymerging, in response to a determining result indicative of that thedifferent predetermined events in the consecutive event clips conform tothe merging rule, the consecutive event clips.
 10. The video fileprocessing method according to claim 7, wherein the differentpredetermined events comprised by the concerned clips correspond to asame person.
 11. The video file processing method according to claim 7,wherein the film outputting module uploads and stores the merged eventfilm into a cloud database.
 12. A monitoring system, comprising: a firstcamera, configured to capture a first video file including at least onefirst event; a second camera, configured to capture a second video fileincluding at least one second event, wherein the first camera and thesecond camera are configured to capture environments of different eventpositions to produce the first video file and the second video filerespectively; and a video recording device, coupled to the first cameraand the second camera, and configured to: receive and record the firstvideo file and the second video file; detect the at least one firstevent from the first video file and detect the at least one second eventfrom the second video file; detect a first occurring time of the atleast one first event in the first video file and detect a secondoccurring time of the at least one second event in the second videofile; extract at least one first event clip corresponding to the atleast one first event from the first video file by acquiring a firsttime period before and after the first occurring time of the at leastone first event; extract at least one second event clip corresponding tothe at least one second event from the second video file by acquiring asecond time period before and after the second occurring time of the atleast one second event; determine whether a time difference betweenstarting times of any two of the extracted first event clips or betweenstarting times of any two of the extracted second clips is less than apreset threshold value; finding an overlapped time segment between theany two of extracted first event clips or between the any two ofextracted second event clips; generating an assembled event clip by bothsumming the any two of extracted first event clips or the any two ofextracted second event clips and removing the overlapped time segmentbetween the any two of extracted first event clips or between the anytwo of extracted second clips in response to the overlapped time segmentbetween the any two of extracted first event clips or between the anytwo of extracted second event clips having been found; obtain a mergedevent film by assembling a plurality of concerned clips according to amerging rule, wherein the plurality of concerned clips comprise one ormore of the at least one first event clip, the at least one second eventclip, and the assembled event clip, and each of the plurality ofconcerned clips included in the merged event film corresponds todifferent predetermined events, wherein the merging rule comprises aplurality of pre-defined reasonable event sequences, and the differentpredetermined events in the merged event film conform to the pre-definedreasonable event sequences; and output the merged event film.
 13. Themonitoring system according to claim 12, wherein the video recordingdevice comprises a network video recorder and a video file processingdevice coupled to each other, the first video file and the second videofile captured by the first camera and the second camera are recorded inthe network video recorder, and the video file processing device is ableto acquire the first video file and the second video file from thenetwork video recorder and generate the merged event film by performingan assembling operation.
 14. The monitoring system according to claim13, further comprising: a cloud database, wherein the video fileprocessing device is coupled to the cloud database, and configured toupload and store the merged event film into the cloud database.